An introduction to stochastic approximation
نویسنده
چکیده
We propose to start the exposition of the topic by an example. The arguments are given in a crude manner. Formal proofs will be given in section 2. This example is taken from the very article [6] which introduced stochastic approximation. Consider x ∈ R the parameter of a system and g(x) ∈ R an output value from this system when parameter x is used. We assume g to be a smooth, increasing function. An agent wants to determine sequentially x∗ ∈ R the value such that the system output equals a target value g∗. If for all x, the value of g(x) can be observed directly from the system, then determining g∗ could be solved by a simple search technique such as binary search or golden ratio search. Here we assume that only a noisy version of g can be observed. Namely, at time n ∈ N, the decision maker sets the parameter equal to xn, and observes Yn = g(xn) +Mn with Mn a random variable denoting noise, with E[Mn] = 0. In order to determine g(x), a crude approach would be to sample parameter x repeatedly and average the result, so that the effect of noise would cancel out, and apply a deterministic line search (such as binary search). [6] proposed a much more elegant approach. If xn > x ∗ , we have that g(xn) > g ∗, so that diminishing xn by a small amount proportional to g −g(xn) would guarantee xn+1 ∈ [x∗, xn]. Therefore, define n a sequence of small positive numbers, and consider the following update scheme: xn+1 = xn + n(g ∗ − Yn) = xn + n(g − g(xn)) + nMn.
منابع مشابه
APPROXIMATION SOLUTION OF TWO-DIMENSIONAL LINEAR STOCHASTIC FREDHOLM INTEGRAL EQUATION BY APPLYING THE HAAR WAVELET
In this paper, we introduce an efficient method based on Haar wavelet to approximate a solutionfor the two-dimensional linear stochastic Fredholm integral equation. We also give an example to demonstrate the accuracy of the method.
متن کاملOptimization of the Inflationary Inventory Control Model under Stochastic Conditions with Simpson Approximation: Particle Swarm Optimization Approach
In this study, we considered an inflationary inventory control model under non-deterministic conditions. We assumed the inflation rate as a normal distribution, with any arbitrary probability density function (pdf). The objective function was to minimize the total discount cost of the inventory system. We used two methods to solve this problem. One was the classic numerical approach which turne...
متن کاملStochastic Approach to Vehicle Routing Problem: Development and Theories
Stochastic Approach to Vehicle Routing Problem: Development and Theories Abstract In this article, a chance constrained (CCP) formulation of the Vehicle Routing Problem (VRP) is proposed. The reality is that once we convert some special form of probabilistic constraint into their equivalent deterministic form then a nonlinear constraint generates. Knowing that reliable computer software...
متن کاملApproximation solution of two-dimensional linear stochastic Volterra-Fredholm integral equation via two-dimensional Block-pulse functions
In this paper, a numerical efficient method based on two-dimensional block-pulse functions (BPFs) is proposed to approximate a solution of the two-dimensional linear stochastic Volterra-Fredholm integral equation. Finally the accuracy of this method will be shown by an example.
متن کاملOptimal flexible capacity in newsboy problem under stochastic demand and lead-time
In this paper, we consider a newsvendor who is going to invest on dedicated or flexible capacity, our goal is to find the optimal investment policy to maximize total profit while the newsvendor faces uncertainty in lead time and demand simultaneously. As highlighted in literature, demand is stochastic, while lead time is constant. However, in reality lead time uncertainty decreases newsvendor's...
متن کاملAPPROXIMATION OF STOCHASTIC PARABOLIC DIFFERENTIAL EQUATIONS WITH TWO DIFFERENT FINITE DIFFERENCE SCHEMES
We focus on the use of two stable and accurate explicit finite difference schemes in order to approximate the solution of stochastic partial differential equations of It¨o type, in particular, parabolic equations. The main properties of these deterministic difference methods, i.e., convergence, consistency, and stability, are separately developed for the stochastic cases.
متن کامل